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Abstract — Channel coding alone is not sufficient to reliably 
transmit a message of finite length K from a source to one 
or more destinations as in, e.g., file transfer. To ensure that no 
data is lost, it must be combined with rateless erasure correcting 
schemes on a higher layer, such as a time-division multiple access 
(TDMA) system paired with automatic repeat request (ARQ) 
or random linear network coding (RLNC). We consider binary 
channel coding on a binary symmetric channel (BSC) and q-ary 
RLNC for erasure correction in a star network, where Y sources 
send messages to each other with the help of a central relay. 
In this scenario RLNC has been shown to have a throughput 
advantage over TDMA schemes as R ' — > oo and q — > oo. In this 
paper we focus on finite block lengths and compare the expected 
throughputs of RLNC and TDMA. For a total message length 
of K bits, which can be subdivided into blocks of smaller size 
prior to channel coding, we obtain the channel coding rate and 
the number of blocks that maximize the expected throughput 
of both RLNC and TDMA, and we find that TDMA is more 
throughput-efficient for small message lengths K and small q. 



I. Introduction 

Random linear network coding (RLNC) has recently been 
shown to improve network performance in several broadcast 
and multicast scenarios. For example, considering packet era- 
sure channels on the link layer, RLNC is known to improve 
throughput and reduce delay for wireless broadcast fll]-|]4]. 
Further, in (31 the joint design of network coding and medium 
access control protocols was considered. 

In contrast to the above work, we consider the joint design 
of channel and network coding. We assume that the size of a 
block is not predetermined and, for a finite message length K, 
the sources in a network may choose the number of data blocks 
so that the throughput of the overall system is maximized. 

The joint design and optimum rate allocation between 
channel and network coding for the block fading channel has 
been investigated in 151-171, where the tradeoff between the 
two schemes is analyzed as the block length on the physical 
layer gets large and the probability of block erasure is given 
by the outage probability of the block fading channel, under 
the assumption that the coherence time of the fading channel 
grows with the block length. 
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Fig. 1. Star network in which Y sources communicate over noisy BSCs 
with the help of a central relay. 



Joint error and erasure correcting coding for finite message 
lengths was analyzed in ISl- flOl . In the authors bound 
the performance of random coding on the physical and link 
layer using error exponents to trade off system throughput 
and delay. In |9) the combination of RLNC and continuous- 
time orthogonal waveform channels was investigated. Both 
papers aim to maximize throughput given a maximum delay 
constraint. By contrast, in this paper we do not enforce a 
maximum delay constraint, but focus instead on the expected 
throughput for reliable communication, assuming the senders 
continue to transmit until the receivers have correctly received 
the entire message as in, e.g., file transfer. Thus we use the 
expected throughput of the network as the performance metric 
and compare it to a TDMA system using ARQ. 

We consider a star network as depicted in Fig. Q] With the 
help of a central relay, Y sources, Si,..., Sy, communicate 
with each other over noisy binary symmetric channels (BSCs). 
We assume there is no direct path between any of the sources, 
i.e., they are only connected to the central relay, which receives 
transmissions from all sources and can broadcast to all sources. 
We consider the case where each source Si has a message of 
finite length K bits that is intended for all the other Y — 1 
sources Sj, j = 1, . . . , Y, j ^ i. 

In this setting, channel coding alone is not sufficient to guar- 
antee reliable communication. To ensure that no data is lost, 
channel coding on the physical layer must be combined with 
rateless erasure correcting schemes, such as a time-division 
multiple access (TDMA) system paired with automatic repeat 
request (ARQ) 0j] or g-ary RLNC 02), where RLNC was 
shown to be asymptotically optimal, as K — > oo and q — > oo, 

We define the time that it takes to transmit one bit as a 
time unit and, when maximizing the expected throughput, 
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we minimize the expected number of time units it takes to 
successfully transmit Y messages from Y sources to the other 
Y — 1 sources. 

More specifically, we aim to answer the questions: 

• Given RLNC over GF(q) and a message of length K bits 
at each source, what is the number of blocks m that the 
sources should use to transmit so that the expected system 
throughput is maximized? 

• What is the channel coding rate for each individual block 
that maximizes system throughput? 

• How does the throughput of RLNC compare to the 
throughput of TDMA as a function of the number of 
blocks m and the Galois field size ql 

Our goal is to jointly find the number of blocks and the channel 
coding rate that maximizes system throughput. Choosing a star 
network as a model allows us to combine several prominent 
features of more general networks. For example, for Y = 2 
sources, the star network reduces to a two-hop line network 
with a relay where the two ends communicate with each 
other. Additionally, in the RLNC case, the star network model 
includes a multiple-access channel (MAC) phase, where all the 
sources simultaneously transmit to a central relay, followed by 
a broadcast phase, where the relay transmits to all sources, 
as illustrated in Fig. Q] We first analyze these two phases 
separately before combining them to maximize the throughput 
of the star network. 

In our analysis, we take the coding overhead of RLNC into 
account. Similar to other rateless coding schemes ifPfl . ||T5l . 
RLNC over a finite number of blocks m and GF(g) exhibits a 
coding overhead, i.e, a receiver on average needs to correctly 
receive more than m blocks to be able to decode. Note that the 
coding overhead is a property of the code itself and is different 
from the signaling overhead, which is usually appended to the 
data in a block header. 

II. System Model 
A. Star Network Setup 

We consider a star network where Y sources Si, ... , Sy 
communicate with each other with the help of a central relay 
as shown in Fig. [T] 

Using RLNC, data transmission is divided into two phases, 
the MAC phase, where all the sources simultaneously transmit 
to the relay, and the broadcast phase, where the central relay 
transmits to all sources. 

As shown in Fig. |2] a source Si, i = 1, . . . , Y, splits its 
message of length K bits into m binary data blocks Dij, j = 
l,...,m, of length K/m bits, or equivalently m q-ary data 
blocks Dij, j = 1, ...,m, of length K/(ml) q-ary symbols. 
We assume that q is a power of two, i.e., q = 2 l and that K 
is divisible by ml. 

A source Si then performs RLNC on its m data blocks to 
create a network coded block Bo, by choosing a vector of 
length m of coefficients from GF(q), where the index b does 
not have a fixed range, since as many blocks are created as 
are necessary to achieve reliable communication. The coded 
block Bit, is then the linear combination of the m data blocks 
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Fig. 2. A source Si divides its message of length K bits into m blocks. 
RLNC over GF(q) is then used to create g-ary network coded blocks B ib . 
After q-ary to binary conversion, a header of size h bits is appended to each 
block and the resulting block of size k = K/m + h bits is protected by a 
linear channel code of rate R to create binary channel coded blocks v^. 



multiplied by the corresponding components of the coefficient 
vector aba, i.e., 



(1) 



which can also be represented as a binary block of length 
K/m bits using the notation B^. 

A header of constant size h bits is then appended to each 
coded block Bn, to form a channel input block B^ of length 
k = K/m + h bits. The header can, for example, contain a 
cyclic redundancy check (CRC) to detect decoding failures. 
Finally, each channel input block Ba is protected by a binary 
channel code of rate R, forming the channel coded block v^. 

a) MAC phase: During the MAC phase, all sources 
transmit to the relay simultaneously. We model the channel 
from the sources to the relay as a binary adder channel [16 j- 
lfT8l . so that the relay receives a value equal to the (real) 
sum of the bits sent by the sources plus a noise termQ. The 
relay then quantizes each received value to the nearest integer 
and makes a hard decision. If the quantized value is even, it 
decides a received zero, and if the quantized value is odd, it 
decides a received one, so that the resulting received bit can 
be modeled as the modulo-2 superposition of the bits sent 
by all the sources plus a noise bit. Equivalently, the received 
superimposed vector at the relay is given by 



n = v b © e = vib © v 2 fc > 



(2) 



where © symbolizes modulo-2 addition and e is a binary 
vector whose elements are Bernoulli i.i.d. random variables 
with probability p mac . This is an extension of the well known 
two-user binary adder channel model and is equivalent to 
sending vib©V26© . . .©vyb over an i.i.d. memoryless binary 
symmetric channel (BSC) with crossover probability p mac . 
Since we are using linear codes, the modulo-2 superposition 
of valid codewords results again in a valid codeword v b , 
which the relay attempts to decode. If the relay is able to 

'For simplicity, the binary adder channel model assumes that the sources 
transmit on-off pulses, the relay accumulates the total energy received from 
all sources in each time slot, and a common clock synchronizes all sources. 
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decode, it broadcasts to the sources. Note that the relay 
does not perform any network coding; it only decodes the 
superposition of the channel coded blocks from the sources. 
Should the relay not be able to decode, it does not transmit. 
We assume the sources can sense the channel, so if the relay 
fails to decode and does not transmit, the sources immediately 
transmit another channel coded block and we have another 
MAC phase. 

b) Broadcast phase: During the broadcast phase we 
assume that the relay is connected to each of the destinations 
via independent BSCs with crossover probability pbr- Note 
that, since we are considering finite block lengths, independent 
BSCs can lead to some sources being able to decode the 
message from the relay, while others fail to do so. We assume 
that the sources are at about the same distance from the relay, 
and thus experience the same path loss, so that they share a 
common channel crossover probability^ 

Each channel coded block Vb sent by the relay during 
the broadcast phase is a linear combination of Ym data 
blocks, multiplied by a corresponding set of Ym network 
coding coefficients. To be able to decode, a source must know 
the network coding coefficients at, that were used to create 
each superimposed block sent by the relay. One method of 
letting the receivers know a& is to add the coefficients to 
the header information. Another way, which we adopt in this 
paper, is to assume that the sources and the receivers use Y 
synchronized pseudo-random number generators, each source 
with a different seed, that generate the sequences for a?,. 

The column vector of Ym network coding coefficients 
at = [aib, . . . , ayb]' corresponding to a block b is the bth 
column in the generator matrix G employed by the RLNC in 
the star network, and Bb = X^ili Bib> tne superposition of 
the network coded blocks, can be viewed as a code symbol of 
the RLNC. 

When a source Si receives a superimposed channel coded 
block Vf, from the relay, it first decodes the binary channel 
code to obtain the modulo-2 superposition of the channel input 
blocks Bjj = Y^a=i Bib- If decoding is successful, as indicated 
by the CRC in the header, the header of size h bits is removed 
and, after binary to q-ary conversion, the <j-ary superposition 
of the network coded blocks Bb is obtained. Source Si then 
subtracts its own contribution to Bb, which is Bn, and stores 
the superposition of the other Y — 1 network coded blocks 
Bjb, j = 1, • • • , Y, j i, as an element in a vector of received 
RLNC symbols. It also stores the subset of (Y — l)m network 
coding coefficients in a?, involved in creating the superposition 
Bjb as a column in its coefficient matrix G^, the perceived 
generator matrix of the RLNC from the point of view of source 
Si. Note that the rows and columns of Gj, a subset of the 
matrix G, do not contain information about blocks that were 
not correctly received by the relay or by source 5,;. 

Once a source Si has received enough blocks from the relay 
to form a matrix G^ with (Y — l)m linearly independent 
columns, it can recover the (Y — l)m data blocks from the 
other sources by inverting the matrix Gj and multiplying it by 

2 The analysis would still be possible, but more tedious, if the channels had 
different crossover probabilities. 



its vector of received RLNC symbols. On average, a source Si 
needs to collect more than (Y— l)m correctly received blocks 
to form a G^ of rank (Y — l)m, and in Subsection III-DI we 
bound the expected overhead of RLNC for finite size Galois 
fields. 

Once a source Si has collected enough blocks to decode the 
RLNC, it sends a single acknowledgment (ACK) to the relay. 
Once the relay has collected Y ACKs from the Y sources, it 
broadcasts an ACK to the sources, terminating transmission. 
All sources continue to transmit until they receive an ACK 
from the relay. We assume that the transmission of an ACK is 
instantaneous and reliable, i.e., that it does not consume any 
resources and it is never received erroneously]^ 

As a reference scheme we consider TDMA transmission of 
the sources, paired with ARQ. We also assume a source splits 
its message into m data blocks, but no network coding is used. 
The MAC phase in Fig. Q] is replaced by a TDMA phase, 
where only one source transmits to the relay at a given time 
and the individual data blocks are again protected by a binary 
channel code of rate R. The transmitting source Si repeats 
the transmission of a channel coded block as many times as is 
necessary for the relay to receive the data block correctly, at 
which point the relay transmits an ACK. After the relay has 
received the data block correctly it broadcasts it to all sources. 
When a source receives the data block correctly, it sends an 
ACK to the relay. The relay repeats the broadcast transmission 
as many times as is necessary until all Y — 1 sources Sj, 
i = 1, . . . , Y and j ^ i, receive the data block correctly. After 
the steps described above have been successfully completed 
for source Si, it is the turn of the next source to transmit 
a data block to the relay, and the sources are scheduled in 
a round robin fashion with m rounds. After each source has 
successfully transmitted m data blocks, the transmission ends. 

B. A Motivating Example 

Consider the case where the error probability on all BSC 
links is zero, i.e., p mac = pbr = 0, and RLNC is performed 
over an infinitely large Galois field. Furthermore, let m = 1 
and the header size h = 0. Each of the Y sources has a 
message, e.g., a file of size K bits to transmit to the others. 
Using the TDMA scheme, for every one of the Y sources, 
there is a phase where the source transmits K bits to the relay 
followed by a phase where the relay broadcasts K bits. The 
average throughput of the TDMA scheme is thus given by 

1 

2' 

For the RLNC scheme, an individual source must collect Y — 
1 blocks of K bits in order to be able to decode, and the 
throughput is given by 

YK 

Trlnc - 2{Y _ 1)K - 
The RLNC scheme thus achieves a throughput gain of 

Jrlnc y 
ttdma y — 1 



T 



TDMA 



(3) 



3 We assume the length of the ACK is negligible compared to the length of 
the message and that it is protected by a more powerful error- correcting code 
than the message itself. 
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over the TDMA scheme. The gain of RLNC is largest when 
only 2 nodes exchange information and decreases to one as 
the number of nodes in the star network gets large. We now 
describe the channel and network coding in more detail. 

C. Channel Coding 

We consider random coding on the physical layer and use 
two different approaches to bound the performance of channel 
coding. 

1) The block error probability e of random coding on the 
BSC with a code rate R can be bounded using the 
random coding error exponent E{R): 

e <2- nE ( R \ (4) 

where n = k/R is the block length of the code and 
k = K/m + h bits. Using the union bound, the random 
coding error exponent for the BSC is given by ||T9l 



E(R) — i?o — R) 



(5) 



where Ro, the cutoff rate of the channel, depends on the 
crossover probability p of the BSC and is given by 



Ro = - log. 



Above the so-called critical rate R cr it, a tighter upper 
bound on the block error probability is obtained by 
using the sphere packing exponent. However, the union 
bound is often used to approximate the performance of 
codes of practical length, and hence we adopt the simple 
form of dU, which also allows us to obtain analytical 
expressions for the optimum channel coding rate and 
optimum number of data blocks. We use the above 
method to bound the performance of channel coding in 
Sections MN1 
2) Tighter bounds on the achievable channel coding rate 
given a block error probability e have been derived in 
lEOl . in the following referred to as the PPV bound. 
The relationship between the achievable code rate R, 
the error probability e, the length of the channel code 
n = k/R, and the BSC crossover probability p can be 
written as 



R = C 



where 



p(l -p) 



log 2 



1-p 



<2 -1 0) 



log 2 (n) 



27! 



(6) 



C = 1 - H(p) 



is the channel capacity of the BSC, M(x) — 
—x log 2 (2:) — (1 — x) log 2 (l — x) is the binary entropy 
function, 

Q(x) = -f= I e~ u2/2 du 



/2tt 

is the tail probability of the Gaussian distribution, and 
Q~ 1 (x) is its inverse. In contrast to the first approach 
described above, using the PPV bound allows us to 
consider code rates up to the channel capacity. We use 



q 4 



» j -J, ." 1 1' iwuwi 



q = 64 



Number of blocks m 



Fig. 3. Expected overhead X(m, q) (markers) in blocks with the upper (solid 
lines) and lower (dashed lines) bounds on the expected overhead in blocks. 



the above method to bound the performance of channel 
coding in Section [VT] and compare the results to those 
obtained in Sections HUHY] 

D. The Expected Overhead of Random Linear Network Cod- 
ing 

In this subsection we bound the expected coding overhead 
of RLNC in the star network. As depicted in Fig. [2] each 
source constructs a random linear network code over m data 
blocks before sending a network coded block to the relay. 
Considering a single source on its own and RLNC over GF(g), 
the probability that m + x independently created column 
vectors of network coding coefficients a form an m x (m + x) 
matrix of rank m, i.e., the probability that m + x network 
coded blocks are sufficient to decode the RLNC of that source 
is given by ETI 



-Psuccoss(m, x, q) = JJ (1 - q 



(7) 



In the star network, a block broadcast by the relay is a 
linear combination of Ym data blocks and every source can 
reduce the problem of decoding the network code to that of 
decoding the (Y — l)m unknown data blocks by subtracting 
out its own data. Then, since the network coding coefficients 
are chosen independently at all sources, the probability that all 
Y sources can construct an invertible matrix of rank (Y — l)m 
from (Y — l)m + x correctly received blocks is given by 



-^success ( m i X i 9' — {P&m 



(Y-l)m 



((Y- l)m,x,q)Y 

Y 



n 



(8) 



We now use a result from lETI to bound (|7]i as 
1 



1 



} (m,x,q) <l-q~ 



(9) 



which can be used to derive upper and lower bounds on 
the expected overhead of RLNC in the star network that are 
independent of the number of data blocks m. Using (O and iQ, 
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the probability P*(m, x = i, q, Y) that overhead x = i blocks 
is required to decode in the star network is upper bounded by 

P* (m,x = i, q, Y) = P* ucccss (m, i, q, Y) 

~ Success ("M - M> y ) 



< 1 - q 



-i + l 



. ^--)q- Tl . (10) 

3/ \(q-W q 3 ] 



The expected coding overhead X* (m, q, Y) of RLNC in 
blocks in the star network is thus upper bounded by 



X*(m,q,Y) =J2iP*(m,x = i,q,Y) 



< 



Y 

E 

Y 
j=l 



Y 



3 A ' V(9-1) J 



q> 



Y\ qV-(q-iy 

k) [ ' (g-l)i(grf-l) 



2 ^x*( 9 ,y). 



(11) 

In the same way, we can lower bound the expected overhead 
of RLNC in blocks as 



x-( m , g ,y)>|:Q(-i) W ( fa\- )j ^.j l) , <i2) 



Both bounds ( TTTT i and ( TT2b are independent of the number of 
data blocks m and tend to zero as the size of the Galois field 
gets large. 

Fig. [3] shows the actual expected overhead for RLNC of a 
single source for several Galois field sizes q compared to the 
upper bound (fTTT i for Y = 1, displayed as solid lines, and 
the lower bound dl2l . displayed as dashed lines, where the 
expected overhead X(m,q) of RLNC in blocks is given by 

ED 

m ^ 

*K<?) = £— (13) 

Although (frJt is not independent of the number of data blocks 
m, Fig. [3] shows that the expected overhead in blocks is well 
approximated by a constant fractional number of blocks. The 
larger the Galois field size q, the better the performance of 
RLNC, and the quicker the expected overhead converges to a 
constant. As q increases, the upper and lower bounds on the 
expected overhead become tighter, and for q = 64 they are 
almost indistinguishable. 

Fig. 2] shows the upper ( fTTT i and lower (fT~2T > bounds on the 
expected overhead of RLNC in blocks for different numbers of 
sources Y. As the number of sources Y increases, the expected 
overhead of RLNC increases as well. 

Modeling the expected coding overhead of RLNC as a 
constant fractional number of blocks leads to opposing op- 
timization criteria for channel coding and RLNC when a 
message of finite size K bits is divided into m data blocks: 

• More data blocks, and thus shorter channel coded blocks, 
lead to a smaller coding overhead of RLNC in bits. 




2 3 4 

Number of Sources Y 



Fig. 4. Upper (solid lines) and lower (dashed lines) bound on the expected 
overhead of RLNC in blocks for different numbers of sources Y. 



• Longer channel coded blocks, and thus fewer data blocks, 
lead to more powerful channel codes. 

In the following, when investigating the optimum number of 
data blocks m and the optimum channel coding rate R, we first 
consider the MAC phase and the broadcast phase separately 
before finding the values that jointly maximize throughput for 
the star network. 

III. The MAC Phase 

In this section we optimize the throughput for the MAC 
phase and do not consider the broadcast phase in the opti- 
mization. To this end, assume that the channels from the relay 
to the sources are error-free, i.e., pbr = 0, so that the relay 
does not need a channel code, and that the relay removes the 
h header bits prior to broadcasting. (When p^ r = 0, appending 
a CRC to detect decoding failures is not necessary.) 

Using random coding error exponents, we obtain the chan- 
nel coding rate R and the number of data blocks m that 
minimize the expected number of transmissions at the sources 
and thus maximize the throughput. We then compare the 
results for RLNC to the optimum rate and number of data 
blocks for TDMA. 

Modeling the expected coding overhead of RLNC as a 
constant fractional number of blocks (fTTT i. on average each 
source must collect (Y — l)m + X*(q,Y) network coded 
blocks to be able to decode, and the expected number of 
channel coded blocks that the sources need must transmit is 
thus given by 

(Y-l)m + X*(q,Y) 



jrrnac 
JU RLNC 



1 



(14) 



where e mac is the block error rate of channel coding for 
a BSC with crossover probability p mac . Using the union 
bound random coding error exponent to approximate the block 
erasure rate (3) and letting n = k/R= (K/m + h)/R be the 
size of a channel coded block in bits, we obtain from ( fT4b 

k((Y-l)m + X*(q,Y)) 



7\rmac 
iv RLNC 



R (1 - 2- k( ~ R o/ R - 1 )) 



(15) 



for the expected number of bits that must be sent by the 
sources. To minimize the expected number of bits sent, i.e., 




Fig. 5. Optimum number of blocks m given the message length K, the 
number of sources Y, and header size h = for RLNC over different Galois 
field sizes q. 



Fig. 6. Optimum number of blocks m given the message length K, the 
number of sources Y, and header size h = 16 for RLNC over different 
Galois field sizes. 



to maximize throughput, we use the partial derivatives of (IT3T > 
with respect to R and m to find the optimum channel coding 
rate and the optimum number of data blocks, respectively. 

For TDMA, a total of Ym blocks must be transmitted to 
the relay by the Y sources and we have 

Ym 



JW TDMA — 



1 



(16) 



Using equations ([Tol l and for transmission over a BSC with 
crossover probability p mac , we obtain 

Y(K + mh) 



JV TDMA ~ 



R 1 



_ 2 -(£+ fc )(4M) 



for the expected total number of transmitted bits. 



A. The Optimum Channel Coding Rate 

Taking the partial derivative of (JT5j with respect to R and 
setting it to zero, we obtain 

l-2- fc (%-i)-ln(2)fc^2- fc ( i ?f-i)=0, 
R 

where k = nR = K/ m + h is the block length before channel 
coding. Using the substitution t = In(2)fc^., we then obtain 

-(t+l)e-(* +1 ) =_ e -M2)fe+i ) 

which can be solved using the Lambert- W function W(x) 
given by 

x = W(x)e wix K 

The optimum channel coding rate as a fraction of the cutoff 
rate of the channel is then given by 

R -ln(2)fc 



Ro W_i (- e -(in(2)fc+i)) + i' 



(18) 



where W_i(a;)represents the lower branch of the Lambert-W 
function J23). Q From (HU we see that the optimum channel 

4 For negative arguments, the Lambert-W function has two solutions. Since 
the ratio R/Ro must be between zero and one, we require W(x) < — 1, so 
the solution must be on the lower branch of the Lambert-W function. 



coding rate ratio R/Ro is only a function of the block length 
k and is independent of the expected overhead X* (q, Y) of 
RLNC and the number of sources Y. It is thus also the 
optimum channel coding rate for a scheme employing TDMA. 

To evaluate the Lambert-W function we use the closed form 
approximation 



W_i(x) « ln(-x) - 



1 



1 - 



1 



1 



Aty/rj/2 



(17) where 



1— A 2 cr exp{— As^/a} 

a = — ln(— x) — 1, 



At = 0.3361, A 2 = 0.0042, and A 3 = 0.0201. The 
approximation has a maximum relative error of only 0.025%. 
Using the approximation, we see that, as the block length k 
increases, the optimum channel coding rate ratio R/Ro tends 
to 1. 

B. The Optimum Number of Blocks 

Now taking the partial derivative of (fT5t with respect to m 
and setting it to zero, we obtain 



1 



H2)zK (g + h) (X*(q, Y) + m(Y - 1)) 
KX*(q,Y) - hm 2 (Y- 1) 



(19) 



where z = (Rq/R) — 1. In general, a closed form solution 
of (fl~9b cannot be found. However, for h = and Y = 2 we 
can again use the Lambert-W function to solve for m, and the 
optimum number of blocks m, given a constant R/Ro and the 
message length K, is 



-]n(2)zK 



(20) 



1 



To obtain the optimum number of blocks m that minimizes 
the expected number of transmissions and maximizes the 
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throughput, we solve ( fT9T > and ([T"8T i jointly using numerical 
methods. For h = 0, Fig. [5] shows the optimum number of 
blocks m given a message length isT, the number of sources 
Y, and RLNC over GF(q). As the total message length K in- 
creases, we observe that the maximum throughput is achieved 
for a larger number of blocks m. Since the expected coding 
overhead X* (q, Y) (ITTb in blocks increases with the number 
of sources in the star network, the optimum number of blocks 
m increases with Y for a fixed message length K. On the other 
hand, since the expected coding overhead X*(q, Y) decreases 
with increasing Galois field size, the optimum number of 
blocks decreases with q. For h = 16, the optimum number 
of blocks is shown in Fig. [6] and we see that by increasing h 
and (using a longer header per block) the optimum number of 
data blocks decreases. 



C. Large Galois Field Considerations 

A common assumption in the analysis of network coding is 
that RLNC is done over a sufficiently large Galois field size 
that the coding overhead is negligible, i.e., X*(q,Y) w for 
large q. 

If we set X*(q, Y) = in (1151 1. the numerator is increasing 
in m, while the denominator is strictly decreasing in m. So 
the smallest possible m, i.e., m = 1, minimizes the expected 
number of transmissions and maximizes throughput. Thus, 
in the absence of a coding overhead, i.e., for q — > oo, the 
optimum strategy for the source is to use a channel code on 
the whole message and not divide it up into smaller blocks. 

The same argument holds for (fTTt and TDMA. If 
X*(q, Y) = 0, throughput is maximized if the sources choose 
m = 1, i.e., the longest (and therefore strongest) possible 
channel code. 

IV. The Broadcast Phase 

In this section we optimize the throughput of the broadcast 
phase without taking the MAC phase into account. To this 
end assume that the channels to the relay are error free, i.e., 
Pmac = 0. Further, since p mac = 0, we assume that during the 
MAC phase the sources transmit to the relay uncoded, i.e., 
R = 1, and that no header is used. A header of length h is 
then appended to each block at the relay, and the relay uses a 
channel code of rate R < 1 to protect the blocks. 

A. TDMA Broadcast Paired With ARQ 

We first consider the TDMA scheme, i.e., broadcast using 
ARQ, where every block is repeated by the relay until all the 
Y — 1 sources that do not know a given transmitted message 
have received it correctly. 

Then the expected number of blocks that the relay must 
broadcast is given by 



7Lf br 

* J TDMA 



FmVl-(l 



Y-l 



(21) 



i=0 



where is the block error rate of a BSC with crossover 
probability pbr- Using and (0 we obtain for the expected 
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Fig. 7. Optimum channel coding rate R for broadcast (solid lines) and 
considering transmission to and from the relay jointly (dashed lines) as a 
fraction of the cutoff rate R.q for different channel input block lengths k. 
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(22) 



For any fixed coding rate R, the factor Y (K + mh) /Rin (l22l 
as well as the block error probability et, r are strictly increasing 
with increasing m. So the throughput for the TDMA system 
paired with ARQ is maximized for m = 1 and a channel input 
block of size k = K + h. 

To obtain the channel coding rate that maximizes the 
throughput, we transform (OH into the finite sum 



Y-l 



M, 



TDMA 



i=l 

and using (0]i and © we obtain 



Y - 1\ Ym 



(23) 



' br 



N- 



TDMA 



£(-i) 1+1 



Y 



Y(K + mh) 



R 1 



(24) 

for the expected number of bit transmissions. We use the 
partial derivative of (l24l w.r.t. R to obtain 



£(-!)' 



i=l 



y\ : 



-izk 



iHn(2)^2 



-izk 



(1 - 2- lzk f 



= 0, (25) 



where z = (Ro/R) — 1. 

For TDMA and Y = 2, the channel coding rate that 
maximizes throughput d25l l in the broadcast phase is the same 
as the rate that maximizes throughput for transmission to the 
relay d 1 8b . obtained in Section [HI] In both cases, messages 
are transmitted from one sender to one intended receiver. For 
larger Y, we can numerically find the solution of 1251 . and 
the optimum rate ratios R/Ro for broadcast from the relay 
for different numbers are destinations are shown as the solid 
lines in Fig. [7] We see that, while ( fT~8T > does not depend 
on the number of sources transmitting to the relay, during 
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Broadcast from the Relay, h = 16 
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Fig. 8. Optimum number of blocks for broadcast from the relay for h = 16. 



the broadcast phase the optimum channel coding rate R for 
TDMA as a fraction of the cutoff rate R decreases as the 
number of broadcast destinations increases and, for Y > 2, 
is smaller than (fT8l l. The optimum number of blocks for the 
TDMA scheme, however, is m = 1 for both transmission to 
the relay, considered in Section HID and the broadcast phase. 

B. Broadcast Using RLNC 

Using RLNC, the expected number of network coded blocks 
that the relay must broadcast is given by J3] 



Mrlnc =m' + J2 1 - 



j=m' 

- Y 

\P(m',j-m',q) 



(26) 



where m' = m(Y — 1) is the number of unknown blocks 
each node 5, must collect, and the probability of successful 
decoding given a received overhead in blocks is given by (|7). 
The expected number of bits that the relay must transmit is 
then given by 

h 

(27) 



/v br 



br 

RLNC- 



We solve the above multidimensional optimization problem 
using numerical methods. For the broadcast scenario using 
RLNC, Fig.[8]shows the optimum number of data blocks m for 
h = 16. Comparing the optimum number of blocks in Fig. [8]to 
the MAC phase displayed in Fig. [6] the number of blocks that 
maximizes throughput is generally smaller for the broadcast 
phase. The most prominent difference between Fig. [8] and 
Fig. [6] is that, while for the MAC phase the optimum number 
of data blocks increases with the number of sources, for the 
broadcast phase the optimum number of blocks decreases with 
an increase in the number of broadcast destinations Y, thus 
putting more emphasis on the channel coding being able to 
provide more reliable individual blocks. 

Fig. [9] shows the expected number of broadcast transmis- 
sions, obtained from ( l24t and d27| i, multiplied by the cutoff 
rate i? an d divided by the total number of message bits 
exchanged between the Y sources, YK. For RLNC, the 
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Fig. 9. Expected number of broadcast transmissions per message bit 
multiplied by the cutoff rate Ro for Y = 5 sources. 

expected number of broadcast transmissions decreases with the 
Galois field size q and increases with increasing block header 
size h. As the total message length K at each source gets large, 
the expected number of broadcast transmissions for RLNC is 
smaller than for TDMA, making it more throughput-efficient, 
but TDMA is more throughput-efficient for small message 
lengths. Asymptotically, the expected number of broadcast 
transmissions for RLNC converges to (Y — 1)/Y, and the 
convergence is faster for larger Galois fields. 

V. Joint Optimization for the Star Network 

From the results from Sections [HI] and [IV] we see that 
the number of data blocks and the channel coding rate that 
maximize throughput differ for transmission from the sources 
to the relay and for broadcast from the relay. In a practical 
system, however, it would be desirable to have the same 
channel coding rate and the same block size for transmission 
to and from the relayjf] 

In this Section we jointly optimize the throughput of the 
MAC phase investigated in SectionHlTland the broadcast phase 
investigated in Section [IV] We assume that during the MAC 
phase and the broadcast phase we want to use the same number 
of data blocks and the same channel coding rate, so that the 
individual blocks are of the same size. We refer to the time 
it takes to transmit one block as a time slot. For the RLNC 
scheme, using ( f26l ). the expected number of time slots that are 
occupied by transmissions in the star network is given by 



^RLNC — ^RLNC ' 1 



1 



Ed 



, )P(m',j - m',q,Y) 



1 



(28) 

where m' = m(Y — 1) is the number of unknown blocks 
a source Si must collect and e mac and efc r denote the block 

5 Note that, for a fixed message length of K bits, keeping the number of 
data blocks m the same is equivalent to keeping the block size constant at 
K/lm q-ary symbols. 




RLNC over GF(4) 



-h = 
■ h = 16 

h = 32 



1000 2000 3000 4000 5000 6000 7000 8000 9000 10000 
Message length K 



RLNC over GF(64) 










16 


h = 


32 



1000 2000 3000 4000 5000 6000 7000 8000 9000 10000 
Message length K 



Fig. 10. Average throughput ratio Trlnc/Ttdma for GF(4). 



Fig. 11. Average throughput ratio Trlnc/Ttdma for GF(64). 



RLNC over GF(64) 



erasure rate during the MAC phase and the broadcast phase, 
respectively. (f28T > relies on the fact that, for every block that 
the relay broadcasts, on average 1/(1 — e mac ) transmissions 
from the sources to the relay are necessary. 

Similarly, using ( [ToT l and (l23l l. for the TDMA scheme the 
expected number of time slots that are occupied by transmis- 
sions is given by 



AL 



TDMA 



_ ji jmac 
-■'"TDMA 

Ym 

1 ^mar 



"TDMA 
Y-l 



Y - 1\ Ym (29) 



1 



In this case, since the throughput for both the transmission 
phase to the relay and the broadcast phase from the relay is 
maximized for m = 1, one block of length k = K + h bits for 
each source Si is also optimum when considering both phases 
jointly. 

In the following, we consider the symmetric case, where 
£mac = £br, or equivalently p mac = PbJj- In this case, the chan- 
nel coding rate that maximizes the throughput for TDMA can 
be obtained by taking the derivative of -/V TDMA = &A/ TDMA 
w.r.t. the channel coding rate R, and the optimum rate is the 
solution to the equation 

Y 



1) 



Y\ 1 



-izk 



ifcln(2)^2 



-izk 



1 - 2 



— zk 



(1 - 2~ izk 



(30) 



(1 

(Ro/R) 



- z k 



o, 



with z = (Rq/R) — 1. The resulting channel coding rate 
that jointly maximizes throughput for the TDMA scheme star 
network is also depicted in Fig. [7] 

As noted in Section lPVl the optimum channel coding rate for 
TDMA transmission to and from the relay, obtained separately, 
is the same for Y = 2 sources. Thus, considering transmission 

6 We assume that the sources and the relay transmit at the same power level 
and that therefore the channels to and from the relay have the same crossover 
probability. If the transmit powers of the sources and the relay are variable, 
one could extend the present analysis to choose a power ratio such that, for 
a given channel coding rate, the same number of data blocks optimizes the 
throughput in both the broadcast and MAC phases. 
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Fig. 12. Optimum number of blocks m in the star network for GF(64). 

to and from the relay jointly, the optimum rate is also given 
by ( |T8b when Y = 2. For Y > 2, the optimum channel coding 
rate for the star network decreases with the number of sources, 
similar to the TDMA broadcast case. However, comparing the 
optimum rate obtained in Section [IV] for the broadcast phase 
alone to the jointly optimum rate obtained from (130b for the 
same number of sources Y, we find that the channel coding 
rate that jointly maximizes throughput for the star network is 
higher than the one that gives the maximum throughput for 
the broadcast phase alone. 

Figs. [TOl and [TTI show the average throughput ratio 

Trlnc 



TDMA 



M* 

TDMA 

JU RLNC 



of RLNC over GF(4) and GF(64) to TDMA, respectively, 
where d28T i and ( 1291 have been used to compute the ratio 
and the asymptotic throughput ratios are plotted as horizontal 
black lines. For GF(4) and small message lengths K, we see in 
Fig. [TO] that the average throughput ratio rises steeply before 
the curves flatten out and slowly approach their asymptotic 
value given by (0. As the block header size h increases, 
the average throughput ratio decreases, and a larger message 
length K is needed to obtain a given average throughput ratio. 
For small message lengths K and large header sizes h, TDMA 
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K= 10000; h = 16 
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Fig. 13. Optimum number of blocks m in the star network vs the crossover 
probability p of the BSC for K = 10000 and h = 16. 

is more throughput-efficient. For example, for RLNC over 
GF(4), h = 32, and Y = 6 sources, we require K > 900 
bits for RLNC to be more throughput-efficient than TDMA. 

Employing RLNC over GF(64) which, compared to RLNC 
over GF(4), decreases the expected coding overhead in blocks, 
we see in Fig. QT|that RLNC is more throughput-efficient than 
TDMA for all values of K and that the average throughput 
ratio converges much faster to its asymptotic value. For 
small values of K, there exists a region where the average 
throughput ratio is independent of the header size h. Above 
a certain message length K, however, the average throughput 
ratios for different header sizes h separate slightly, with the 
region of independence extending to larger message lengths K 
for larger values of Y. Comparing Fig. QT| to Fig. Q~2] which 
shows the number of data blocks that maximizes throughput 
for the star network with RLNC over GF(64), we see that the 
region where the average throughput ratio is independent of h 
coincides with the region of K values for which the optimum 
number of blocks is m = 1. 

VI. Bounding Channel Coding Performance Using 
the PPV Bound 

In this section we use the PPV bound given by (O to 
relate the channel coding rate R and block length n to the 
probability of block error e. Compared to the random coding 
error exponent bound obtained by using (0) and (0, where the 
largest possible channel coding rate is the cutoff rate, using 
© allows channel codes that are asymptotically able to reach 
channel capacity. 

The numerical results obtained using © to model channel 
coding in general show the same behavior as reported in 
Sections IITI14V1 We observe that the maximum throughput for 
RLNC is achieved for a larger number of data blocks m as the 
message length K increases and, for a fixed message length 
K, the optimum number of blocks decreases as the Galois field 
size of RLNC increases. In addition, the optimum number of 
blocks decreases as the block header size h increases in length. 

In contrast to the results in Sections IHTHVl however, where 
the optimum number of data blocks m did not directly depend 
on the crossover probability p of the underlying BSC and the 
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Fig. 14. Throughput ratio Trlnc/Ttdma for RLNC and different values 
of p. 

optimum channel coding rate could be expressed as a fraction 
of the cutoff rate Ro, using © we find that the number of 
data blocks m that maximizes throughput for RLNC varies 
with the crossover probability p. 

Fig. Q~3] shows the optimum number of data blocks m of 
RLNC for star networks with Y = 2 and Y — 6 sources 
versus the BSC crossover probability p mac = Pbr = P- The 
message length is K = 10000 bits, the header size is h = 16 
bits, and RLNC overGF(4), GF(16), and GF(64) is considered. 

Compared to the random coding error exponent approach 
(see Figs. [8] and [12}, using (O to bound the channel coding 
performance generally results in a smaller optimum number 
of data blocks. This implies that, when maximizing through- 
put, the tighter bounding approach places more emphasis on 
channel coding and low block error probabilities and less 
on reducing the coding overhead of RLNC. Furthermore, the 
optimum number of data blocks m decreases as the channel 
quality degrades, thus requiring longer channel coded blocks, 
which implies more powerful codes. 

The average throughput ratio Trlnc/Ttdma for star net- 
works with Y — 2, Y = 3, and Y = 6 sources, RLNC 
over GF(4) or GF(64), and header size h = 32 is shown in 
Fig. [14] The throughput ratios for p mac = pb r = p = 0.04, 
which corresponds to a BSC capacity of roughly C = 0.75 
bits/transmission, are plotted in solid lines, the throughput 
ratios for p = 0.11 (C = 0.5) are plotted in dashed lines, 
the throughput ratios for p = 0.21 (C = 0.25) are plotted in 
dash-dot lines, and the asymptotic throughput ratios are plotted 
as horizontal black lines. We see that the average throughput 
ratio decreases slightly with the BSC crossover probability 
p. Compared to the random coding error exponent approach 
(see Figs. [Tol and fTTT). using decreases the throughput ratio 
Trlnc/^tdmAi implying that employing stronger channel 
codes reduces the advantage that RLNC has over TDMA. 

Specifically, for RLNC over GF(4), h = 32, Y = 6 sources 
in the star network, and transmission over BSCs with p 
i 1.2 1 . we now require K > 1800 bits for RLNC to become 
more throughput-efficient than TDMA. In general the ratios in 
Fig. 03] approach their asymptotic values much more slowly 
than in Figs. [10] and [TT] and we also see a larger gap between 
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the throughput ratios employing RLNC over GF(4) and RLNC 
over GF(64). 

VII. Conclusions 

We analyzed the joint design of channel coding on the 
physical layer and random linear network coding on the link 
layer for a star network where Y outer sources send fixed 
length messages to each other with the help of a central relay. 
For RLNC over a finite Galois field of size q and messages 
of total length K at each source, we obtain the number of 
data blocks and the channel coding rate that should be used 
to maximize the throughput of the star network using RLNC, 
assuming binary symmetric channels between the sources and 
relay and a binary adder channel model at the relay. We also 
obtain the optimum number of blocks and the optimum rate 
for a reference TDMA system and compare the throughputs of 
the two transmission schemes. We find that, for small message 
lengths K and RLNC over small Galois fields q, TDMA is 
more throughput-efficient than RLNC, while RLNC is more 
throughput-efficient when the message length K gets large. 
We employ two different approaches to model the probability 
of channel decoding failure, a simplified random coding error 
exponent based on the union bound and the PPV bound for 
finite block lengths, where the PPV bound allows the use of 
more powerful, capacity achieving channel codes. We find that 
the average throughput ratio of RLNC to TDMA decreases 
using the PPV bound, implying that for finite block lengths, 
stronger channel coding reduces the advantage that RLNC has 
over TDMA. 
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